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ABSTRACT 

Infrared Dark Clouds (IRDCs) are dark clouds seen in silhouette in mid-infrared surveys. They are 
thought to be the birthplace of massive stars, yet remarkably little information exists on the properties 
of the population as a whole (e.g. mass spectrum, spatial distribution). Genetic forward modelling is 
used along with the Two Micron All Sky Survey and the Besangon Galactic model to deduce the three 
dimensional distribution of interstellar extinction towards previously identified IRDC candidates. This 
derived dust distribution can then be used to determine the distance and mass of IRDCs, independently of 
kinematic models of the Milky Way. Along a line of sight that crosses an IRDC, the extinction is seen to 
rise sharply at the distance of the cloud. Assuming a dust to gas ratio, the total mass of the cloud can be 
estimated. 

The method has been successfully applied to 1259 IRDCs, including over 1000 for which no distance 
or mass estimate currently exists. The IRDCs are seen to lie preferentially along the spiral arms and in 
the molecular ring of the Milky Way, reinforcing the idea that they are the birthplace of massive stars. 
Also, their mass spectrum is seen to follow a power law with an index of —1.75 ± 0.06, steeper than giant 
molecular clouds in the inner Galaxy, but comparable to clumps in GMCs. This slope suggests that the 
IRDCs detected using the present method are not gravitationally bound, but are rather the result of density 
fluctuations induced by turbulence. 

Subject headings: dust, extinction - Galaxy: structure - ISM: clouds - infrared: ISM 



1. Introduction 

The first repo rted detection o f Infrared Dark Clouds 
(IRDCs) was bv IPeraultld 1996b . who noted regions in 
mid-infrared (MIR) ISO CAM observation s that were 
optically thick at 15 fim. lEgan et alj Ql_998) noted that 
many of these dark features do not emit significantly 
in the far-infrared, implying that they are cold (< 20 
K) and dense (n > 10 5 cm 3 ) molecular clouds. 

There has been much interest recently in IRDCs, 
as they are thought to be the initial phase of mas- 
sive star formation. Indeed, active star formation 
has been detected in a number of dense IRDCs 



dOrmel et al.ll2005t iRathbome et alj |2005l IPillai et al 
2006; Chamber s et alj|2009l) . In order to obtain more 
informati on on the clou ds themselves as a popula- 
tion, [simonetaL ( 2006ah analysed MIR images from 
the Midcourse Space Experiment (MSX. IPricdl 1995b . 
They searched for contiguous regions of high decre- 
mental contrast with respect to the MIR background 
and they identified over 10000 IRDC candidates in 
the first an d fourth Galactic qua drants. In a subse- 
quent study, ISimon etaL (2006b) used 13 CO observa- 
tions to determine kinematic distances to and masses 
of ove r 300 of the IRDC candidates. Rath borne et alj 
(2006) used the kinematic distances determined by 
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Simon et al (2006b) along with dust continuum ob- 
servations and determin ed the mass e s of nearly 40 
IRDCs. More recently, Ijackson et all d2008h used CS 
observations to determine the kinematic distances to 
over 300 4th quadrant IRDCs. The typical size of 
the IRDCs found by the above authors is 5 pc, their 
average mass is ~ 3 x 10 3 M Q , and their distances 
indicate that they l ie preferentially in the molecular 



ring o f the Galaxy (Sim on et alJl2006bt Ijackson et al. 



2008). The mass spectra for the IRDCs is found to 
follow a power law, with a spectral index steeper than 
that fo und for Giant Molec ular Clouds in the inner 
Galaxy dSimon et aL 2006bl) . The cores within the 
IRDCs, however, a re steeper still and are very close 
to the Salpeter IMF dRathborne et al.|[2Q06h . 

The area of the Galaxy where the detection of 
IRDCs is easiest, namely inside the solar circle due to 
the high MIR background, is also the region where dis- 
tance determinations are the most problematic. Tack- 
ling the problem from a variety of angles is therefore 
very important in order to avoid introducing bias. In 
order to extend and complement the latest findings on 
the Galactic distribution of IRDCs, a new tridimen- 
sional extinction mapping method is used to locate 
their positions and line of sight extinctions. Stars ob- 
served in the near infrared are compared to a Ga lactic 
Population Synthesis model (Ro bin et alj 120031) . and 
the extinction required to redden the modelled stars to 
match the observed stars is f ound via a genetic algo- 
rithm ( G A. ICharbonneaul 1 9951) . Near infrared data are 
well suited to study these IRDC candidates as K s band 
extinction is onl y a factor of two higher than the ex- 
tinction at 8 fim dlndebetouw et al.ll2005l) . 

In section|2]we describe the observations and model 
used to calculate the extinction. In section [3] we de- 
scribe how we determine the line of sight extinction 
to IRDC candidates, and how this information is con- 
verted to a distance and mass estimate for the cloud. 
We describe the results in section [4] and discuss the 
implications in section[5] We conclude in section[6] 

2. Model and observations 
2.1. Two Micron All Sky Survey 



The Two Micron All Sky Survey (2MASS. lSkrutskie et al 
2006) is a ground based survey which uniformly 



scanned the entire sky in three near-infrared bands ( J, 
H & K s ). Amongst its final products is the point source 
catalogue (PSC) which includes point sources brighter 
than about 1 mJy in each band, with signal-to-noise 



ratio (SNR) greater than 10, using a pixel size of 2.0". 
It is complete down to a limiting magnitude of 15.7, 
15.1 and 14.3 in the J, H and K s bands, respectively, 
and in the absence of confusion. 

2.2. The Galactic model 



The stellar population synth esis model of th e 
Galaxy constructed in Besangon (Ro bin et al] 120031) . 
hereafter called the Galactic model, is able to simulate 
the stellar content of the Galaxy by modelling four 
distinct stellar populations: the thin disc, the thick 
disc, the outer bulge and the spheroid. It also takes 
into account the dark halo and a diffuse component 
of the interstellar medium. It can be used to generate 
stellar catalogues for any given direction, and returns 
information on each star such as magnitude, colour, 
and distance as well as kinematics and other stellar 
parameters. 

The approach of the Galactic model is semi- 
empirical as it is based on theoretical grounds (for 
example stellar evolution, galactic evolution and galac- 
tic dynamics) and is constrained by empirical obser- 
vations (the local luminosity function for example). 
The Galactic potential is calculated in order to self- 
consistently constrain the disc scale height. In ad- 
dition, the model simulates photometric errors and 
includes Poisson noise to make it ideal for direct com- 
parison with observations. 

The Galactic model has been used for various 
studies, such as identification of Galactic struc- 



tures through stellar overdensities (Picaud et al. 2003 
Bellazzini et al]|2004t iMomanv et alj|2004 . Galactic 



para meter estimation from fitting model to o bserva- 
tions dPicaud & Robinl2004tlRevle et alj20"09b as well 
as the qua ntification of the uncertainty i n extinction es- 
timation (Froebrich & del Burgol l2006) to name but a 
few^ Many more examples are given in iRobin et al. 



(2003) 



2.3. Comparing model to observations 

The Galactic model has been developed to return 
results in the near-infrared and visible filters, and is 
a powerful tool to extract the extincti on information 



embedded in the 2MASS observations (Marshall et al. 
2006). 



However, the PSC can be compared quantitatively 
with the Galactic model simulations only where the 
former is shown to be complete. As such, we must 
compute the faint completeness limit field by field. We 
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do so by creating magnitude histograms in each band 
and locating the faintest magnitude where the source 
counts are still increasing linearly. We use a bright 
cutoff of 10 in all three bands, as recommended in the 
online explanatory supplement In order to closely 
model the 2MASS stars, we cut the modelled stars at 
the 2MASS completeness limits and simulate the pho- 
tometric errors via an exponential function which is 
able to reproduce typical CCD photometric errors very 
well. 

3. Method 

The method used to determine the distance and ex- 
tinction to the IRDCs is based on a modified version of 
the three dimensional extinction method first described 
(120061) 



Marshall et al. 



The base of the extinction 
method remains a comparison of the stellar colours of 
the Galactic model and the 2MASS dataset, as extinc- 
tion gives rise to a colour excess. For example, for the 
J and K s bands the colour excess is: 

E(J - K s ) = (J - K s ) - (J - K s ) (1) 

where the subscript denotes intrinsic colour. This 
equation can be rewritten as: 

E( J - K s ) = (J- J Q ) - (K s - K s0 ) =Aj- A Ks (2) 

as the extinction in a given band is just the differ- 
ence between observed and unredenned apparent mag- 
nitude. Putting these relations together, we can write: 



A Ks =R JK E(J-K s ) 



(3) 



where Rjk - (l/(Aj/Ag s - 1)) is the ratio of absolute 
to relative extinction. The modifications to the basic 
method are described below. 

3.1. Genetic algorithm approach 

In iMarshall et all (2006) the authors calculate the 
extinction via the colour difference between observed 
reddened stars and simulated unreddened ones (i.e. to 
which no extinction correction has been applied). The 
distance information is then obtained by assuming that 
more distant stars are redder, as interstellar extinction 
in a monotonically growing function with distance. 
This is true for the giant star population only, so dwarf 
stars are excluded from the calculation. 



'http : //www . ipac . caltech . edu/2mass/releases/allsky/doc/sec6_5al . html 



An alternative is to turn the problem on its head. 
Instead of calculating an extinction based on colour 
differences between observed and modelled stars, one 
can generate a large number of extinction distributions 
and keep the one that results in a good fit between ob- 
served and modelled stars. An immediate advantage 
of this method is that the information from all of the 
stars is used, not just the giants. However the sheer 
number of possible solutions prevents us from apply- 
ing a "brute force" approach. We have instead ap- 
pl ied the genetic algor ithm (GA) approach described 
in lCharbonneaul (1 19951) . 

In essence the GA generates and tests different ex- 
tinction distributions towards a candidate IRDC, de- 
fined as a number of points each with a distance and 
a corresponding extinction. The extinction is thus no 
longer calculated, but is an output from the GA. For 
each generated extinction distribution, the modelled 
stars are reddened using Eqj3] and the goodness of fit 
between the observed and simulated stellar colour dis- 
tributions is calculated via a merit function, described 
below. Initially, a "population" of random solutions 
is generated. Solutions in this group with the highest 
values of the merit function are combined to produce 
"child" solutions in a new "generation" of solutions. 
This evolution is allowed to continue for at most 500 
generations after which the fittest solution found is re- 
tained. If the solution converges before then, defined 
as 25 generations with no improvement in the fittest in- 
dividual, the iteration is stopped and the fittest solution 
found up to that point is adopted. 

The use of Galactic modelling and GAs to solve for 
three dimensional extinction is an interesting alterna- 
tive to other techniques, as no single stellar population 
need be isolated, and there is no such thing as fore- 
ground contamination - all stars are used in the ex- 
tinction calculation. However these advantage comes 
at a price - high CPU use resulting in a long execu- 
tion time. Each point along a line of sight towards the 
IRDC, and contained within its defining ellipse, cor- 
responds to two parameters (distance and extinction) 
and so by reducing the number of parameters to fit, a 
substantial gain in time is possible. This is done by fix- 
ing the extinction values on a regular grid along each 
line of sight and searching only for the corresponding 
distance to each extinction value. This approach is su- 
perior to fixing the distance to a regular grid, as sharp 
rises in extinction are easier to identify. 

The extinction values along the line of sight are thus 
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Fig. 1 . — Left: Extinction distribution for an example line of sight. The observations have been simulated using the 
Galaxy model with known extinction distribution (solid line). Note that the distance error bars generally increase with 
distance. Right: Colour histogram for the J-K s colour index for the simulated observations (solid line) and the model 
adjusted with the extinction points of figure on the left. 



equally spaced: 



Aics(i) - ixdA 



(4) 



for each line of sight point ; and where 6Ak s is taken to 
be twice the minimum extinction detectable. Assum- 
ing that the minimum extinction detectable arises from 
the uncertainty in the 2MASS / - K s colour, this value 
is then, using Eq[3j 



5A Ks = 2 * 



V(A7 2 + AKp 
(Aj/A Ks ) - 1 



(5) 



where AJ and AK S are the average uncertainty in the 
magnitude of the J and K s bands, respectively. Typi- 
cally, this corresponds to 0.06 < 5Ak s ^0.1. The dis- 
tribution of photometric errors is Poissonian, requir- 
ing the use of the mean in the above equation so as to 
not allow the extinction calculation to work purely on 
noise. Also, the use of the J—K s colour index is justifi- 
able as it is the most sensitive of the three to interstellar 
extinction. 

The distance values along the line of sight are: 



n = r,_! + 5r-, i > 1 



(6) 



where r\ — and the <5r,- are the variables that are 
searched for using the GA. 

We have added two modifications to this schema. 
First of all, a crude estimate to the solution is estimated 
by finding a smooth extinction solution. The Galactic 



dust distribution in the model is approximated using 
a double exponential disc with an ad hoc local nor- 
malisation. The best local normalisation is found in 
order to minimise the x 1 difference between observed 
and modelled stellar colour distributions for the partic- 
ular line of sight. This rough solution is inserted into 
the initial population as an "alpha male", intended to 
get the GA looking in the right direction. This "alpha 
male" is generally replaced by a fitter solution within 
the first couple of generations and does not influence 
the final solution, but greatly reduces the convergence 
time. If the initial random generation of solutions con- 
tains a fitter individual than the alpha male is not used. 

Secondly we have modified one of the reproduc- 
tion functions used to create child solutions, namely 
the crossover operator. GAs encode the solutions into 
individual strings called genotypes. In our case the so- 
lution is a series of Sri values (Eqj6j which the GA 
normalises. A simple example consisting of two points 
which we describe to three decimal digits would then 
be a six digit number. The crossover operator would 
take two parent genotypes, randomly select a digit to 
perform the cut, and then simply swap the remaining 
digits between the two parents to create two new chil- 
dren. This singular cut results in a slower mixing of the 
genotypes so we have changed it to N-point crossover. 
In this scheme, the number of cuts is a random variable 
and so the children contain more diversity than in the 
simple crossover operator. This approach emphasises 
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exploration of the parameter space, with less exploita- 
tion of the fittest individual. We are not looking for 
a high precision solution as the colour distribution of 
the observations can be reproduced with a spread of 
extinction solutions (see Sect l3.3l >. Rather we want to 
avoid any secondary, local minima in the^ 2 minimisa- 
tion. 

3.2. Merit function 

For each line of sight we are comparing observed 
stars with simulated stars from the Galactic model via 
the histograms of their respective colour indices (J-K s 
and H - K s ). We calculate their difference using a 
X 2 statistic, where higher merit is associated with a 
lower value of the^ 2 statistic. However, if the Galac- 
tic model overestimates or underestimates the number 
of stars along a particular line of sight we do not want 
this to influence the extinction we measure. Consider 
a hypothetical case where the model predicts a factor 
of two too many stars. If we were to use a standard 
X 1 test between modelled and observed colour distri- 
bution, then the extinction we find could be artificially 
boosted in order to remove a number of simulated stars 
by pushing them beyond the completeness limit for the 
line of sight. This is obviously an unacceptable situa- 
tion. 

To avoid this "artificial extinction" the merit func- 
tion used is b ased on a normalised x 2 test from 



Press et all d 19921) : 

( y/NJN m n n 



X' 



+ «o 



(7) 



where n . (N ) and n mi (N m ) are the number of stars 
in the 2 th bin of the colour histogram (total number 
of stars along the line of sight) of the observations 
and model, respectively. The term involving the to- 
tal number of stars ensures that the x 2 statistic will be 
lowest when the shapes of the two histograms are the 
same, regardless of any difference in the total number 
of stars. To ensure the best fit, the J - K s and H - K s 
colour indices are adjusted simultaneously. If only one 
is used, the GA will find the optimum solution for that 
colour index, perhaps at the expense of one of the oth- 
ers. An example line of sight is presented in of the left 
hand side of FigQ] using simulated observations from 
the Galactic model. The simulated observations con- 
tain a diffuse extinction component (of 0.7Aykpc _1 ) 
along with two clouds at 1.5 and 5 kpc with extinctions 
of 0.34 and 0.57 magnitudes in the K s band respec- 
tively. The stellar density and extinctions used in the 



simulation is typical of that seen for the IRDCs. The 
histograms of the J-K s colour index are shown on the 
right side of FigQ]for the simulated observations (solid 
line) and the Galactic model reddened using the points 
shown on the left hand side of FigQ] The GA is seen to 
find the diffuse extinction as well as both clouds. The 
detection of the clouds and the determination of their 
characteristics will be discussed in section [3~4l below. 

3.3. Error estimation 

The random nature of the GA means that two subse- 
quent calculations along the same line of sight will not 
be identical. Furthermore, the Besancon model gen- 
erates simulated stars using density distributions and 
luminosity functions, meaning that generating a simu- 
lated stellar catalogue twice for the same line of sight 
will not result in the same exact solution either. How- 
ever, by rerunning the Galactic model and repeating 
the extinction calculation a number of times and tak- 
ing the mean, a more robust estimator of the extinction 
distribution can be obtained. The variation in the so- 
lutions found provide us with a measure of the uncer- 
tainty, as they provide a range of extinction solutions 
which are able to fit the model to the observations. 

Each iteration to calculate the extinction along the 
line of sight is a lengthy process and we wish to min- 
imise the number of iterations while still performing 
enough to obtain a robust mean. In order to determine 
the number of solutions which should be generated, 
we calculated the extinction for a few lines of sight 
300 times. It was found that there was no difference in 
the mean after 100 and after 300 iterations, so we as- 
sume that the method has converged at 100 iterations 
and that we have found the best solution and associated 
uncertainty. This number of iterations would require 
a very lengthy calculation for the thousands of can- 
didate IRDCs so we searched for the minimum num- 
ber of iterations which satisfactorily reproduced the re- 
sult found after 100 iterations. The results of this test 
for one line of sight is presented in Fig|2] The black 
solid line with the grey shaded area represent the mean 
and the standard deviation, respectively, after 100 iter- 
ations. From left to right, and top to bottom we display 
the results after 3, 5, 10 and 15 iterations. As can be 
seen, there is far too much variation in the mean for it- 
erations less than 15. After 15 iterations, the mean and 
standard deviations are very close to those obtained af- 
ter 100 iterations. We thus proceed using only 15 iter- 
ations. 
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Fig. 2. — Extinction along a test line of sight at I = 50° b = 0°. Each figure shows the resulting extinction after 
averaging a different number of calculations. The solid line and grey shaded area show the average and standard 
deviation after 100 runs. The diamonds show the mean and the crosses show the standard deviation after the number 
of runs showed in the top left corner. 



3.4. Cloud identification and properties 

Clouds for which we attempt to measure their ex- 
tincti on come from a ca talogue of IRDC candidates 
from dSimon et al . 2006a). In this catalogue the IRDCs 
are defined as contiguous regions of MIR extinction. 
The catalogue defines ellipses enclosing the clouds 
and supplies their position, major and minor axes, po- 
sition angle as well as peak contrast and area. For 
each cloud we fetch the 2MASS observations within 
the ellipse. All clouds which contain over 50 ob- 
served stars in J, H and K s are retained. We require 
all three magnitudes in order to calculate the vari- 
ous colour indices. There is a large spread in stellar 
colours towards the dark clouds and we require at least 
five stars in each colour histogram bin to calculate a 



meaningful x statistic. The lower bound of 50 stars 
is chosen somewhat arbitrarily in order to ensure the 
colour histograms have a sufficient number of stars per 
bin. The line of sight extinction towards this ellipse 
is t hen calculated. In Fi g|3] an example ellipse from 
the ISimon etaL ( 2006ah catalogue is overplotted on 
2MASS data (top) and GLIMPSE mid infrared data 
(bottom). The number of 2MASS stars detected is 
lower within the ellipse, and the colours of the stars 
are limited to blue (foreground) and heavily reddened 
(background). The GLIMPSE data shows a substan- 
tial decrease of the 8 yu diffuse emission within the 
ellipse. However the number of point source detec- 
tions, within completeness limits, is nearly three times 
higher in the 3.6 jjm and 4.5 fim IRAC bands than in 
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Fig. 3. — Infrared observations towards MSXDC 
G028.37+00.07 (ellipse). Top: Near infrared obser- 
vations from 2MASS. Bottom: Mid infrared observa- 
tions from Spitzer / GLIMPSE. 



the three 2MASS bands. This opens up the possibility 
of using these longer wavelength observations to con- 
duct a similar study in the future. 

To determine the extinction of and distance to an 
individual cloud, we calculate the derivative of the ex- 
tinction with respect to distance along the line of sight 
(i.e. dAxs Idr). This, in effect, gives us the space den- 
sity of absorbing matter which in our case is dominated 
by large dust grains (sizes typically of the order of 1 



t"12.5 




Distance [ kpc ] 



Fig. 4. — Example of cloud detection for MSXDC 
G028. 37+00.07. The diamonds and error bars are as 
for FigQ] The solid line is the density of absorbing 
matter (dAzs/dr), the dashed line is the diffuse extinc- 
tion and the dotted lines show the diffuse component 
+ 1 and 3 cr. A cloud has been detected at ~ 5.3 kpc 
with an extinction of ~ 0.9 mag in the K s band. 



lira). The value of dAf:s Idr can be converted to n# (in 
total hydrogen, in atomic or molecular form, per cubic 
centimetre) by assuming a conversion factor between 
extinction and hydrogen column density. However as 
this requires us to assume such a value, and as it is just 
a multiplicative factor, we continue to work in units of 
mag kpc -1 in the K s band. 

The line of sight extinction will be the sum of a 
diffuse extinction upon which is superimposed one or 
more clouds. To simplify, we modelize this diffuse 
component as a constant extinction per unit distance, 
which we choose to be that which minimises the mean 
absolute deviation (which we will denote cr in the fol- 
lowing) of the line of sight density from the diffuse 
component. As clouds are present along the line of 
sight, using all the points will of course overestimate 
the diffuse component. We therefore repeat the calcu- 
lation by omitting all points with a density above 3cr. 
This is repeated until the fractional change of diffuse 
component and cr are less than 2%. 

All peaks in dA^s/dr over 3cr above the diffuse 
component are then considered as a cloud. The ex- 
tent of the cloud is found by finding the adjoining 
points which remain 1 cr above the diffuse component. 
The extinction of the cloud is then the increase in 
the extinction over the cloud minus the diffuse com- 
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Fig. 5 . — Distribution of the IRDCs in the plane of the Galaxy. The dashed lines are the spiral arms from I Valilel 
d2008l) . the local Orion spur and the Galactic bar. The dotted lines show the Galactocentric distances at 3 and 6 kpc. 
The IRDCs are seen to be concentrated in the molecular ring (3 < R < 6 kpc) and along the spiral arms. 



ponent. Only clouds with an extinction over Ak s <: 
0.114 are considered, as this threshold corresponds 
to the density necessary for clou ds to be dominated 
by molecular gas (i . e. A v > 1 ISpitzer et al.l Il973 



tion law of Cardelli et al 



Binney & Merrifield 1998), if we assume the extinc 



1989). If more than one 



cloud is found along the line of sight, the one with the 
largest average extinction is retained as the IRDC. An 
example line of sight is presented in Fig0] 

Once identified, the distance to the IRDC is simply 
the density weighted average over the cloud peak: 



>"irdc = 



K'=L„ r i x (dA Ks /dr)i 
El™. {dA Ks ldr)i 



(8) 



The extinction is converted to a column density us- 
ing the empirical relation between the extinction in the 



/ band, A j, and Nh found bv lVuong et al.l (2003) in p 
Oph dark cloud: 



N H = 5.57 x 10 21 A y cm" 2 mag" 



(9) 



where Nh is the column density of hydrogen derived 
from observations of x-ray absorption. We have cho 



sen th is value over the more well known Bohli n et al. 
d 1978b relation (N H = 1.9xl0 21 A y cm" 2 mag" 1 ) as the 



Vuong et al. (2003) result was obtained in a dense star 
forming region, more similar to the environment that 
we expect to find in IRDCs. It should be noted, how- 
ever, that the impact on our mass determinations be- 
tween the two is only around ~ 20%. As our extinction 



results are in the Ks ba nd, we adapt t he Vuon g et al. 
d2003l) relation using the lCardelli et al.l d 1989b extinc- 
tion curve (A v : Aj : A Ks = 1 : 0.282 : 0.1 14). 

In order to transform this into a mass, the column 
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density is multiplied by the area of the c loud and a 
multiplicative factor of 1.36 dAllenl l 19731) is applied 
to take into account the presence of Helium and other 
heavy elements. The mass of a cloud with area Eirdc 
pc 2 is then : 



MlRDC 



150 x 



mag 



/ ^IRDC \ 

\ PC 2 V 



(10) 



This method assumes that the cloud covers the en- 



tire ellipse defined in the ISimon et al.l (l2006al) IRDC 
candidate catalogue. As this is not always the case, 
this supposition may result in the mass being overes- 
timated. On the other hand, if parts of the cloud com- 
pletely obscure the NIR light emitted by background 
stars then the cloud mass will be underestimated. Nev- 
ertheless, the resulting mass is expected to be accurate 
to within a factor of a few. 

4. Results 

The present method has been applied to over 1500 
IRDC candidates (using the selection described above) 
and we have detected 1259 of them with an Ay > 1. 
The use of the genetic algorithm allows us to apply 
the technique to a l ower number o f star s than the 
method published in Marshall et al.l (|2006), allowing 
us to probe smaller and denser clouds. As a result, the 
current method results in a factor of four more cloud 
detections than the older method. Due to the number 
of detections, the full table of the IRDC characteris- 
tics will only be available electronically, via the VizieR 
servic^l, but an example of the results is shown in ta- 
ble 1. The distribution of the detected IRDCs in the 
plane of the Galaxy is displayed i n Fig|5] The overlaid 
spiral arms are from Vallee (2008), with the addition of 
the local Orion spur. The bar angl e plotted here is 20°, 
in agreement with recent values iGerhard d2002h . This 
spiral structure is not constrained by the distribution of 
IRDCs presented here, but is overplotted for compari- 
son. The clouds are seen to preferentially lie along the 
spiral arms of the Galaxy, as well as in the molecular 
ring (3 < R < 6). Also, a concentration of IRDCs is 
visible at the end of the near side of the Galactic bar. 
These regions are all associated with high rates of star 
formation, reinforcing the idea that IRDCs are the pre- 
cursors to massive stars. 

The distribution of the clouds in Galactocentric dis- 
tance is shown in Fig|6] Few clouds are seen to lie 



within 3 kpc of the centre. The bulk of the clouds are 
concentrated between 3 and 6 kpc, corresponding to 
the molecular ring. This is reflected in both the 1st and 
4th quadrant IRDCs, however there is an asymmetry 
at R — 6 kpc due to the presence of the Carina arm. 
Furthermore, the local Orion spur is present in the 1st 
quadrant only, creating a peak at R ~ 8.5 kpc. There 
are more IRDCs in the 4th quadrant, however this may 
just be due to the presence of more spiral arm tangents 
in this quadrant, making the detection of IRDCs easier. 

The range of masses of the detected IRDCs run 
from just over ten solar masses up to 8.7 x 1O 4 M , 
with an average mass of ~ 3.5 x 10 3 M Q . Defining their 
size as the radius of a circle with the same area as the 
IRDC ellipse, the average size of the IRDCs is ~ 7pc 
with the size ranging from a minimum of 0.9 to a max- 
imum of 64 pc. The average extinction is Ak s ~ 0.35 
mag corresponding to a total hydrogen column den- 
sity of ~ 5 x 10 2I cirT 2 while the maximum extinction 
detected is Ak s = 1-32 mag or ~ 2 x 10 22 crrT 2 . As 



pointed out by Sim on et al.l (12006bl) . this range places 
IRDCs somewhere between the larger, more diffuse 
giant molecular clouds, and the smaller, denser Bok 
globules. 

The mass spectrum of the IRDCs is estimated 
by binning the mass distribution into logarithmically 
spaced bins. The mass spectrum is then calculated 
from the number of clouds in each bin divided by the 
size of the bin : 



J dM AMi 



(11) 



http://vizier.u-strasbg.fr/viz-bin/VizieR 



where /V, is the number of clouds in bin i and AM, 
is the size of the / th bin. Mass spectra constructed 
using binned masses can introduce systematic errors 
as the slope is sensitive to the choice of the bin size. 
However, for samples with /V > 500 the resulting 
uncertainty in the spectral index is lower than 0.1 
jRosolowskyll2005l) . 

The binned mass spectrum of the IRDCs is dis- 
played in Fig|7] The errors shown do not take into 
account errors in the mass estimation, and are sim- 
ply counting errors : <x = V^V/AM. Small clouds 
are undersampled, due to the relatively large amount 
of stars necessary to apply the 3D extinction tech- 
nique, and to the fact that large clouds are much easier 
to detect at large distances. For masses greater than 
~ 1.7 x 10 3 M@, the mass spectrum is seen to fol- 
low a power law (f(M) oc M 7 ). A straight line fit 
to the spectrum for M > 1.7 x 1O 3 M yields a spec- 
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Name 


/ 


b 


Distance 


A Ks 


Size 


Mass 


Nobs 


*? 


cr 




o 


o 


kpc 


mag 


pc 


M 








G025.65+00.55 


25.6550 


0.5590 


3.92 


0.580 


8.94 


5768 


381 


2.43 


51.55 


G025 .66-00. 12 


25.6690 


-0.1260 


4.69 


0.160 


4.57 


416 


60 


0.60 


14.11 


G025.72-00.28 


25.7240 


-0.2870 


4.85 


0.200 


4.64 


535 


65 


0.80 


32.41 


G025. 74-00.44 


25.7450 


-0.4410 


4.35 


0.180 


13.43 


4044 


726 


3.11 


9.90 


G025.79+00.81 


25.7990 


0.8150 


4.10 


1.040 


5.96 


4594 


116 


1.26 


67.43 


G025.85+00.49 


25.8530 


0.4950 


3.98 


0.900 


6.07 


4134 


102 


1.01 


439.74 


G025.90+00.33 


25.9060 


0.3350 


4.29 


0.470 


5.17 


1563 


100 


0.68 


41.12 


G025.92+00.26 


25.9260 


0.2660 


7.06 


0.170 


9.38 


1864 


92 


0.67 


3.79 


G025 .94+00.63 


25.9420 


0.6390 


2.63 


0.280 


3.09 


332 


134 


0.94 


11.34 


G026.18+00.14 


26.1900 


0.1500 


5.07 


0.460 


5.20 


1547 


74 


0.97 


22.18 



Table 1: Example of the results for the IRDC characteristics. For each cloud is listed, in order, it's name, Galactic 
coordinates / and b, distance, extinction, size, mass, the number of 2MASS stars used in extinction calculation, the 
reduced^ 2 (Eq.4) and the number of standard deviations the density of the cloud sits above the background (see Fig. 3). 
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Fig. 6. — Distribution of Galactocentric distances of 
IRDCs: Dotted line (all clouds), dashed line(l st quad- 
rant) and solid line (4 th quadrant). The peak occurs in 
the molecular ring. 



Fig. 7. — Mass spectrum of the IRDCs. After ~ 
1.7 x 10 3 M o , the spectrum is well fit by a power law 
(solid line) with a possible break at high mass. The two 
vertical lines delimit the zone where the cloud sample 
is deemed to be complete. 



tral index of y = -1.75 ± 0.06 (solid line in Fig0 
steeper than report ed values for molecular clouds in 



the in ner Galaxy (ISolomon et alJ 1987; Rosolowskv 



2005) and flatter than the slope found bv lSimon et al. 
(2006b) for their IRDC selection. The data seem to 



suggest that at the highest masses there is a break in 
the power law. However the average number of clouds 
per bin at these masses is just over 2, so it is not pos- 
sible to reach any firm conclusion. These points (to 
the right of the second vertical line in FigO have not 
been used in the linear fit One possible explanation to 
the high mass cutoff seen may be high mass clouds 
being mistakenly identified as several smaller mass 
clouds in IRDC input catalogue, an effect mentioned 



by lRosolowskyl (120051) . In order to tackle this problem, 
the IRDCs could be characterised using higher resolu- 
tion MIR observations such as the GLIMPSE survey, 
which would enable a more robust characterisation of 
the IRDCs. 



Hennebelle & Chabrier (2008) examine the mass 



spectral index y of a cloud population governed by 
density fluctuations induced by turbulence. They find 
that y = -2 + (n' — 3)/3 where n' is the power spec- 
trum index of log(p). They also mention that simula- 
tions show n' to be close to the Kolmogorov turbu- 
lence index, or n' ~ 11/3. This results in a mass 
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spectral index of y ~ -1.77 - very close to what we 
find. This suggests that the IRDCs in our sample are 
not virialised but are the result of density fluctuations 
induced by turbulence. This observation does not ex- 
clude the presence of higher density, autogravitating 
regions within the individual clouds in our IRDC se- 
lection. These higher density regions could in fact be 
completely missed by our method as the column den- 
sity of the dust would be too high to observe back- 
ground stars in the near infrared and at the 2MASS 
completeness limit. 

5. Discussion 

5.1. Limits of NIR extinction 

The main difficulties in using the present method to 
obtain information on the IRDCs are the small sizes 
and large densities of the clouds. The technique we 
employ requires the detection of many stars along the 
line of sight to obtain enough information on the 3D 
extinction. This rules out small clouds and clouds 
so dense that too few stars are detected behind them. 
Thus, for our initial selection, we start by rejecting all 
clouds that lie along lines of sight having fewer than 
50 stars detected in the three 2MASS bands. 

The completeness of the 2MASS catalogue is lim- 
ited by source confusion, and is a problem in the 
Galactic plane and towards the Galactic bulge. How- 
ever, as we are concentrating on high extinction fea- 
tures, source confusion is less of a problem and the 
2MASS completeness limit is less severe. In fact, the 
average completeness in three bands of our IRDC sam- 
ple is 15.2, 13.9 and 13.1 for the /, H, and Ks bands, 
respectively. As we use a bright cutoff at magnitude 
10, the largest J - K s observable is 5.2, which results 
in a maximum extinction of ~ 3.5 in the K s band and a 



maximum column density of 5.1 x 10 cm , from Eq. 
[9] The minimum extinction detectable results from the 
colour uncertainty in the 2MASS dataset, and corre- 
sponds to about 7 x 10 20 cm -2 . This range of den- 
sities is compatible with the typical de nsities found 
for IR DCs through 13 CO observations dSimon et al 
20063). 



5.2. Comparison with existing measures 



Simon et al. (2006b) measured the CO line to- 



wards over 300 candidate IRDCs in the first Galactic 
quadrant. They used the radial velocity channels of 
the CO observations that matched the extinction fea- 



tures in the MSX data dSimon et alj|2006al) . They then 
attributed a kinematic distance to the cloud by assum- 
ing circular motion and using the IClemensI (11985b ro- 
tation curve. They solved the inner Galaxy distance 
ambiguity by assuming that all clouds are at their near 
kinematic distance as the clouds are seen in extinction 
and therefore must lie in front of the Galactic back- 
ground emission. The masses were calculated from the 
CO emission and by assuming an H2 to CO conversion 
factor. 

We have detected 107 of their sample using the 
present method - the other clouds in their study ei- 
ther have too few 2MASS stars detected towards them 
(> 75% of their sample) or do not present an extinc- 
tion feature with Ay > 1. For a number of their lines 
of sight, they include distances for two or more line 
of sight clouds; we take the highest density cloud to 
compare with our estimate. The different results for 
the distances of this IRDC sample are compared on the 
left side of Fig fS] The black solid line shows equality 
and the dotted lines are placed at +2 kpc from equality. 
Although there is appreciable scatter, there is general 
agreement for most of the points. Clouds for which 
extinction distances differ greatly from the kinematic 
distances may, in fact, be different clouds. As we have 
not mapped out the cloud, we are not able to morpho- 
logically match the e xtinction in the NIR to the MIR 
extinction features as ISimon etal . (2006b) do. On the 
other hand, some of their distances estimates may be 
wrong due to their systematic choice of the near kine- 
matic distance. On the right hand side of the FigfS]is 
the histogram of the difference between the two dis- 
tance determinations. Here we can see that over 80% 
of the distance determinations are within 2 kpc of each 
other. There is only a slight systematic offset of ~0.5 
kpc. 

Jackson et a l. (2008) obt ained distan ces for IRDC 



Sim on et al. ( 2006ah sample in the 



candidates from the 
fourth quadrant using observations of the CS molecule. 
This molecule requires high density for excitation 
making it a good tracer of dense clouds. Like us, 
they do not map the clouds but search for the highest 
density feature along the line of sight. In the centre 
of Fig fS] we compare our distance measurements to 
theirs, for the 95 lines of sight where both methods de- 
tect a cloud. For a number of their lines of sight, they 
include distances for two line of sight clouds; we take 
the higher density cloud to compare with our estimate. 
There is a systematic offset of around 1.5 kpc between 
the two methods (FigJH] right), with just a few points 
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Fig. 8. — Left : Scatter plot of extinction distances with the kinematic distances listed in lSimon et al. I d2006bl) . The 
black line shows equality a nd the dotted lines ar e at +2 kpc. Middle : Comparison of extinction distances with the 
kinematic distances listed in Jack son et alJd2008l). R ight : Histogra m of differences betw een extinction distances and 
kinematic distances as found bv lSimon et al.1 d2006bl) (solid line) and Jackson et al. (2008) (broken line). A significant 
~ 1.5 kpc systematic difference is found with the distances reported by Jackson et alJ (120081) . 



differing by much more. As mentioned above, these 
outliers may be due to different line of sight clouds 
being detected with the two methods or may be due to 
th e systematic choosing of the near kinematic distance 
bv lJackson eTal] d2008l) . 

As the mass depends on the square of the distance, 
the latter is key in determining accurate masses. In 
Fig. |9]we compare o ur mass e stimates with tho s e from 
Simon et alJ d2006bl) (+) and iRathborne etall (2006) 
(o). Note that Fig. [9] is a log-log plot. There is ap- 
preciably more scatter here compared to that observed 
for the distances. Most mass estimates agree within 
a factor of te n and our mass estimat es are generally 
larger than the Rat hborne et al. (2006) estimates. Con- 
sidering the uncertainties in the three methods, an or- 
der of magnitude difference between methods is to be 
expected as the techniques used trace different parts of 
the IRDCs. 

CO observations of IRDCs are a good tracer of the 
lower density envelope, but may be optically thick to- 
wards the denser cores. Also, the low temperatures 
in the cores may cause the CO molecules to form ice 
mantles on the dust grains dAlves et al.l [T999). Obser- 
vations of the dust mm-continuum is optically thin, 
even at the very high column densities of the IRDC 
cloud cores. On the other hand, current mm obser- 
vations are more sensitive to the small scale structure 
but mi ss out much of the ex tended emission of the 
cloud ( Bergin & Tafallal 2007 ). Thus it is not sur- 
prising that our mass estimates are closer to those of 



Simo n et al. (2006b), as our method will also miss out 
the dense cores while recovering the lo wer density en 



velope , and that they are higher than IRathborne et al. 



(2006) as most of the mass of da rk clouds is con- 
tained in the lower density envelope ( Alves et alT l 999 ; 



Cambresv et al .1 120021; iLombardi et all 2006). 



By c o mparing their r e sults w ith those from lSimon et al 



(2006b), 



Jackson et al.1 (120081) report an asymmetry 
in the distribution of 1 st quadrant IRDCs compared 
to 4 th ones. Our analysis does not show this asym- 
metry, however we find that the kinematic distances 
in the first quadrant have less of an offset to the ex- 
tinction distances than the fourth quadrant IRDCs. 
This may be evidence of a difference in the veloc- 
ity field between the two quadrants, or may be due 
to a large scale stellar asymmetry not present in the 
Galactic model. Indeed, results from the Galactic 
Legacy Mid-Plane S urvey Extraordinaire (GLIMPSE 
iBenjamin et al]l2005l) show stellar overdensities along 
a number of lines of sight crossing spiral arm tan- 
gencies. However these are present in both the first 
and fourth quadrants with similar intensities, and so 
they do not seem to be responsible for introducing 
such a large bias in our measurements. Furthermore, 
a number of precise parallax measurements of star 
forming regions using the NRAO Very Long Base- 
line Array (VLBA) show very good agreement with 
the kinematic distances towar ds massive star forming 



2008; 



region s in the fi rst quadrant (Brunthal er et al. 
Zhang et alJ 120081) . although, to our knowledge, no 
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Fig. 9. — Comparison of our mass estimates 
with those from Isimon et all ( 2006b Xplus signs) and 
Rathb orne et al.1 d2006l) (diamonds). 



similar measurements exist for the fourth quadrant. 
Kinematic distances in the region studied have uncer- 
tainties of 0.5 -1.5 kpc and occasionally up to 3 kpc 
dG6mezl l2006) due to non-circular motions associated 
with the spiral arm shocks. The distance obtained from 
the 3D extinction has an uncertainty of ~ 0.5 - 1 . kpc. 
However, the selection of IRDCs seen in extinction 
may be polluted by other line of sight clouds. 



5.3. Galactic spiral structure 

The distribution of IRDCs shown in Fig. |5]is seen to 
follow the spiral structure of the Galaxy. However, de- 
partures are to be seen, especially between the Norma 
and Scutum arms at Galactocentric distances of be- 
tween 4 and 5 k pc. This distance match e s that of the 
Mole cular ring dTavlor & Cordesl 1 19931: iDame et all 
2001) which may be a ring like structure or may be 
the result of current angular resolution being u nable 
to res olve multiple spiral arms in this region dValled 
2008). The spiral pattern traced out by the IRDCs in 
Fig. [5]could be explained by either an actual molecu- 
lar ring or the bifurcation of one arm into two. Kinks 
and wiggles are present in every external spiral galaxy 
observed, so we are not to expect perfect logarithmic 
spirals from our own host Galaxy. The current re- 
sults, however, are not enough to refute or confirm the 
molecular ring hypothesis, as there may be some (as 
yet) unknown bias affecting the results in the 4th quad- 
rant. 



The current catalogue of IRDC candidates dSimon et al 
2006a) is not a complete survey of cold dark, "IRDC- 
like", clouds in the Galaxy. Indeed, it has been com- 
piled with a severe bias - as the clouds are seen in ex- 
tinction in the MIR, they require a bright background 
to be detected. As such they are almost exclusively 
on the near side of the Galaxy and are concentrated at 
very low Galactic latitudes. With the recent launch of 
Planck, a full sky survey of sub-millimetre emission 
from cold dust will soon be accessible. These obser- 
vations will provide an unbiased survey of the cold 
cores which are known to be associated with IRDCs. 
The present 3D extinction method could be adapted 
to probe these higher density cores by utilising stellar 
MIR observations, such as the GLIMPSE survey (see 
FigI3, or the upcoming all sky MIR surv ey "Wide 
field Infrared Survey Explorer" 
2004 . 



(WISE, Duval et al. 



Compared to kinematic distances, the 3D extinc- 
tion method has the advantage of providing tempera- 
ture independent measures of the dust column density, 
as well as avoiding the near - far distance ambiguity 
which plagues kinematic distance estimates within the 
solar circle. By using the MIR observations mentioned 
previously, along with the deep NIR observations of 
the Galactic Plane from the "UKIRT Infrared Digital 
Sky Survey (Gala ctic Plane Survey)" (UKIDSS(GPS), 
Lucas et al J 120081) . it will be possible to map out the 
spiral structure of our Galaxy to the far side of the so- 
lar circle and perhaps beyond. 

6. Conclusions 

We have applied a new three dimensional extinc- 
tion technique towards over 1500 IRDC cloud candi- 
dates, for which we recovered distances and masses 
for 1259 of them, including over 1000 previously un- 
measured clouds. This is a much larger sample than 
previous studies, enabling the study of the cloud popu- 
lation as a whole. The spatial distribution of the clouds 
is found to be concentrated in the molecular ring and 
along the spiral arms, reinforcing the idea that IRDCs 
are the birthplace for high mass stars. Their mass spec- 
trum follows a power law with a slope of - 1 .75 + 0.06 
which is steeper than GMCs in the inner Galaxy, sim- 
ilar to CO clumps and shallower than molecular cores 
or the stellar IMF. This slope suggests that the IRDC 
population is composed of non-gravitationally bound 
clouds, and are the result of density fluctuations in- 
duced by turbulence. However, higher density auto- 
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gravitating clumps inside the individual clouds of our 
IRDC selection would most likely be missed by the 
present method. 

This new method is independent of any kinemat- 
ical information, thus providing a new way to ob- 
tain information on the Galactic distribution of the 
ISM. It is a good complement to existing measures 
which are solely based on molecular gas kinematics 
as both methods are completely independent and both 
are affected by different systematics. It will be able 
to provide valuable distance information for use in 
the analysis and interpretation of far-infrared and sub- 
millimetre observations by Herschel and Planck. In 
the future it could be used with deeper stellar obser- 
vations or observations at longer wavelengths in order 
to probe even higher density clouds and to even larger 
distances. 
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